Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 4745 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 42 |
| Duplicate rows (%) | 0.9% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 235.8 B |
Variable types
| Text | 7 |
|---|---|
| Numeric | 16 |
| Categorical | 2 |
| Dataset has 42 (0.9%) duplicate rows | Duplicates |
content_rating is highly imbalanced (50.8%) | Imbalance |
budget is highly skewed (γ1 = 49.20095838) | Skewed |
director_fb_likes has 835 (17.6%) zeros | Zeros |
actor_3_fb_likes has 66 (1.4%) zeros | Zeros |
facenumber_in_poster has 2035 (42.9%) zeros | Zeros |
movie_fb_likes has 2104 (44.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-10 12:32:52.580708 |
|---|---|
| Analysis finished | 2024-04-10 12:34:50.520855 |
| Duration | 1 minute and 57.94 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
director_name
Text
| Distinct | 2244 |
|---|---|
| Distinct (%) | 47.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 13.079663 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62063 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1371 ? |
|---|---|
| Unique (%) | 28.9% |
Sample
| 1st row | James Cameron |
|---|---|
| 2nd row | Gore Verbinski |
| 3rd row | Sam Mendes |
| 4th row | Christopher Nolan |
| 5th row | Andrew Stanton |
| Value | Count | Frequency (%) |
| john | 175 | 1.8% |
| david | 145 | 1.5% |
| michael | 124 | 1.3% |
| james | 85 | 0.9% |
| peter | 84 | 0.9% |
| robert | 81 | 0.8% |
| richard | 79 | 0.8% |
| paul | 78 | 0.8% |
| scott | 65 | 0.7% |
| lee | 57 | 0.6% |
| Other values (2797) | 8889 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5874 | 9.5% |
| 5117 | 8.2% | |
| a | 5022 | 8.1% |
| n | 4474 | 7.2% |
| r | 4280 | 6.9% |
| o | 3657 | 5.9% |
| i | 3532 | 5.7% |
| l | 2854 | 4.6% |
| t | 2236 | 3.6% |
| s | 2001 | 3.2% |
| Other values (66) | 23016 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62063 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5874 | 9.5% |
| 5117 | 8.2% | |
| a | 5022 | 8.1% |
| n | 4474 | 7.2% |
| r | 4280 | 6.9% |
| o | 3657 | 5.9% |
| i | 3532 | 5.7% |
| l | 2854 | 4.6% |
| t | 2236 | 3.6% |
| s | 2001 | 3.2% |
| Other values (66) | 23016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62063 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5874 | 9.5% |
| 5117 | 8.2% | |
| a | 5022 | 8.1% |
| n | 4474 | 7.2% |
| r | 4280 | 6.9% |
| o | 3657 | 5.9% |
| i | 3532 | 5.7% |
| l | 2854 | 4.6% |
| t | 2236 | 3.6% |
| s | 2001 | 3.2% |
| Other values (66) | 23016 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62063 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5874 | 9.5% |
| 5117 | 8.2% | |
| a | 5022 | 8.1% |
| n | 4474 | 7.2% |
| r | 4280 | 6.9% |
| o | 3657 | 5.9% |
| i | 3532 | 5.7% |
| l | 2854 | 4.6% |
| t | 2236 | 3.6% |
| s | 2001 | 3.2% |
| Other values (66) | 23016 |
num_critic_for_reviews
Real number (ℝ)
| Distinct | 527 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 146.08662 |
| Minimum | 1 |
|---|---|
| Maximum | 813 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 57 |
| median | 117 |
| Q3 | 201 |
| 95-th percentile | 391 |
| Maximum | 813 |
| Range | 812 |
| Interquartile range (IQR) | 144 |
Descriptive statistics
| Standard deviation | 121.11349 |
|---|---|
| Coefficient of variation (CV) | 0.8290526 |
| Kurtosis | 2.8862095 |
| Mean | 146.08662 |
| Median Absolute Deviation (MAD) | 68 |
| Skewness | 1.5052961 |
| Sum | 693181 |
| Variance | 14668.478 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81 | 33 | 0.7% |
| 43 | 30 | 0.6% |
| 63 | 29 | 0.6% |
| 25 | 29 | 0.6% |
| 46 | 28 | 0.6% |
| 16 | 28 | 0.6% |
| 97 | 28 | 0.6% |
| 50 | 28 | 0.6% |
| 112 | 28 | 0.6% |
| 29 | 28 | 0.6% |
| Other values (517) | 4456 |
| Value | Count | Frequency (%) |
| 1 | 22 | |
| 2 | 17 | |
| 3 | 9 | 0.2% |
| 4 | 14 | |
| 5 | 26 | |
| 6 | 15 | |
| 7 | 18 | |
| 8 | 25 | |
| 9 | 24 | |
| 10 | 26 |
| Value | Count | Frequency (%) |
| 813 | 1 | |
| 775 | 1 | |
| 765 | 1 | |
| 750 | 2 | |
| 739 | 1 | |
| 738 | 1 | |
| 733 | 1 | |
| 723 | 1 | |
| 712 | 1 | |
| 703 | 2 |
duration
Real number (ℝ)
| Distinct | 164 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.62044 |
| Minimum | 14 |
|---|---|
| Maximum | 330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 84 |
| Q1 | 94 |
| median | 104 |
| Q3 | 118 |
| 95-th percentile | 146 |
| Maximum | 330 |
| Range | 316 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 22.523631 |
|---|---|
| Coefficient of variation (CV) | 0.20736089 |
| Kurtosis | 11.772115 |
| Mean | 108.62044 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 2.226497 |
| Sum | 515404 |
| Variance | 507.31396 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 143 | 3.0% |
| 100 | 136 | 2.9% |
| 101 | 133 | 2.8% |
| 98 | 130 | 2.7% |
| 97 | 125 | 2.6% |
| 93 | 122 | 2.6% |
| 95 | 121 | 2.6% |
| 99 | 120 | 2.5% |
| 94 | 120 | 2.5% |
| 106 | 109 | 2.3% |
| Other values (154) | 3486 |
| Value | Count | Frequency (%) |
| 14 | 1 | |
| 20 | 1 | |
| 25 | 1 | |
| 34 | 1 | |
| 37 | 1 | |
| 41 | 1 | |
| 45 | 2 | |
| 46 | 1 | |
| 47 | 1 | |
| 53 | 1 |
| Value | Count | Frequency (%) |
| 330 | 1 | |
| 325 | 1 | |
| 300 | 1 | |
| 293 | 1 | |
| 289 | 1 | |
| 280 | 1 | |
| 271 | 1 | |
| 270 | 1 | |
| 251 | 2 | |
| 240 | 2 |
director_fb_likes
Real number (ℝ)
ZEROS 
| Distinct | 429 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 707.98778 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 835 |
| Zeros (%) | 17.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 8 |
| median | 52 |
| Q3 | 209 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 201 |
Descriptive statistics
| Standard deviation | 2853.5003 |
|---|---|
| Coefficient of variation (CV) | 4.0304373 |
| Kurtosis | 26.160629 |
| Mean | 707.98778 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 5.1294727 |
| Sum | 3359402 |
| Variance | 8142464.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 835 | 17.6% |
| 3 | 65 | 1.4% |
| 6 | 61 | 1.3% |
| 7 | 58 | 1.2% |
| 11 | 57 | 1.2% |
| 2 | 56 | 1.2% |
| 4 | 54 | 1.1% |
| 10 | 51 | 1.1% |
| 12 | 48 | 1.0% |
| 5 | 48 | 1.0% |
| Other values (419) | 3412 |
| Value | Count | Frequency (%) |
| 0 | 835 | |
| 2 | 56 | 1.2% |
| 3 | 65 | 1.4% |
| 4 | 54 | 1.1% |
| 5 | 48 | 1.0% |
| 6 | 61 | 1.3% |
| 7 | 58 | 1.2% |
| 8 | 47 | 1.0% |
| 9 | 46 | 1.0% |
| 10 | 51 | 1.1% |
| Value | Count | Frequency (%) |
| 23000 | 1 | < 0.1% |
| 22000 | 8 | 0.2% |
| 21000 | 10 | 0.2% |
| 18000 | 4 | 0.1% |
| 17000 | 20 | |
| 16000 | 28 | |
| 15000 | 2 | < 0.1% |
| 14000 | 30 | |
| 13000 | 26 | |
| 12000 | 17 |
actor_3_fb_likes
Real number (ℝ)
ZEROS 
| Distinct | 905 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 667.40506 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 66 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 141 |
| median | 384 |
| Q3 | 642 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 501 |
Descriptive statistics
| Standard deviation | 1708.8646 |
|---|---|
| Coefficient of variation (CV) | 2.5604609 |
| Kurtosis | 57.241954 |
| Mean | 667.40506 |
| Median Absolute Deviation (MAD) | 251 |
| Skewness | 7.0857549 |
| Sum | 3166837 |
| Variance | 2920218.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 124 | 2.6% |
| 0 | 66 | 1.4% |
| 11000 | 29 | 0.6% |
| 2000 | 27 | 0.6% |
| 3000 | 26 | 0.5% |
| 3 | 23 | 0.5% |
| 826 | 22 | 0.5% |
| 7 | 21 | 0.4% |
| 249 | 19 | 0.4% |
| 4 | 18 | 0.4% |
| Other values (895) | 4370 |
| Value | Count | Frequency (%) |
| 0 | 66 | |
| 2 | 16 | 0.3% |
| 3 | 23 | 0.5% |
| 4 | 18 | 0.4% |
| 5 | 12 | 0.3% |
| 6 | 17 | 0.4% |
| 7 | 21 | 0.4% |
| 8 | 15 | 0.3% |
| 9 | 14 | 0.3% |
| 10 | 12 | 0.3% |
| Value | Count | Frequency (%) |
| 23000 | 2 | < 0.1% |
| 20000 | 1 | < 0.1% |
| 19000 | 5 | 0.1% |
| 17000 | 1 | < 0.1% |
| 16000 | 3 | 0.1% |
| 15000 | 1 | < 0.1% |
| 14000 | 6 | 0.1% |
| 13000 | 5 | 0.1% |
| 12000 | 8 | 0.2% |
| 11000 | 29 |
actor_2_name
Text
| Distinct | 2824 |
|---|---|
| Distinct (%) | 59.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 28 |
|---|---|
| Median length | 25 |
| Mean length | 13.069547 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62015 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1915 ? |
|---|---|
| Unique (%) | 40.4% |
Sample
| 1st row | Joel David Moore |
|---|---|
| 2nd row | Orlando Bloom |
| 3rd row | Rory Kinnear |
| 4th row | Christian Bale |
| 5th row | Samantha Morton |
| Value | Count | Frequency (%) |
| michael | 94 | 1.0% |
| david | 54 | 0.6% |
| john | 53 | 0.5% |
| james | 50 | 0.5% |
| tom | 48 | 0.5% |
| scott | 48 | 0.5% |
| jason | 43 | 0.4% |
| robert | 42 | 0.4% |
| kevin | 39 | 0.4% |
| bruce | 36 | 0.4% |
| Other values (3619) | 9306 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5568 | 9.0% |
| 5068 | 8.2% | |
| n | 4486 | 7.2% |
| r | 4154 | 6.7% |
| i | 3805 | 6.1% |
| o | 3427 | 5.5% |
| l | 3232 | 5.2% |
| t | 2210 | 3.6% |
| s | 2039 | 3.3% |
| Other values (68) | 22159 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62015 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5568 | 9.0% |
| 5068 | 8.2% | |
| n | 4486 | 7.2% |
| r | 4154 | 6.7% |
| i | 3805 | 6.1% |
| o | 3427 | 5.5% |
| l | 3232 | 5.2% |
| t | 2210 | 3.6% |
| s | 2039 | 3.3% |
| Other values (68) | 22159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62015 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5568 | 9.0% |
| 5068 | 8.2% | |
| n | 4486 | 7.2% |
| r | 4154 | 6.7% |
| i | 3805 | 6.1% |
| o | 3427 | 5.5% |
| l | 3232 | 5.2% |
| t | 2210 | 3.6% |
| s | 2039 | 3.3% |
| Other values (68) | 22159 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62015 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5568 | 9.0% |
| 5068 | 8.2% | |
| n | 4486 | 7.2% |
| r | 4154 | 6.7% |
| i | 3805 | 6.1% |
| o | 3427 | 5.5% |
| l | 3232 | 5.2% |
| t | 2210 | 3.6% |
| s | 2039 | 3.3% |
| Other values (68) | 22159 |
actor_1_fb_likes
Real number (ℝ)
| Distinct | 843 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6821.6906 |
| Minimum | 0 |
|---|---|
| Maximum | 640000 |
| Zeros | 14 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 117 |
| Q1 | 638 |
| median | 1000 |
| Q3 | 11000 |
| 95-th percentile | 24000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 10362 |
Descriptive statistics
| Standard deviation | 14943.707 |
|---|---|
| Coefficient of variation (CV) | 2.1906164 |
| Kurtosis | 722.02214 |
| Mean | 6821.6906 |
| Median Absolute Deviation (MAD) | 792 |
| Skewness | 19.530386 |
| Sum | 32368922 |
| Variance | 2.2331439 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 421 | 8.9% |
| 11000 | 209 | 4.4% |
| 2000 | 190 | 4.0% |
| 3000 | 151 | 3.2% |
| 12000 | 135 | 2.8% |
| 13000 | 126 | 2.7% |
| 14000 | 122 | 2.6% |
| 10000 | 109 | 2.3% |
| 18000 | 109 | 2.3% |
| 22000 | 80 | 1.7% |
| Other values (833) | 3093 |
| Value | Count | Frequency (%) |
| 0 | 14 | |
| 2 | 6 | |
| 3 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 2 | < 0.1% |
| 9 | 3 | 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 640000 | 1 | < 0.1% |
| 260000 | 2 | < 0.1% |
| 164000 | 2 | < 0.1% |
| 137000 | 2 | < 0.1% |
| 87000 | 8 | 0.2% |
| 77000 | 1 | < 0.1% |
| 49000 | 27 | |
| 46000 | 1 | < 0.1% |
| 45000 | 5 | 0.1% |
| 44000 | 2 | < 0.1% |
gross
Real number (ℝ)
| Distinct | 4146 |
|---|---|
| Distinct (%) | 87.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45202964 |
| Minimum | 162 |
|---|---|
| Maximum | 7.6050585 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 100751 |
| Q1 | 6531491 |
| median | 24848292 |
| Q3 | 54557348 |
| 95-th percentile | 1.7096688 × 108 |
| Maximum | 7.6050585 × 108 |
| Range | 7.6050568 × 108 |
| Interquartile range (IQR) | 48025857 |
Descriptive statistics
| Standard deviation | 64598174 |
|---|---|
| Coefficient of variation (CV) | 1.4290694 |
| Kurtosis | 17.370468 |
| Mean | 45202964 |
| Median Absolute Deviation (MAD) | 20807704 |
| Skewness | 3.3896832 |
| Sum | 2.1448806 × 1011 |
| Variance | 4.1729241 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24848292 | 461 | 9.7% |
| 5000000 | 4 | 0.1% |
| 34964818 | 3 | 0.1% |
| 3000000 | 3 | 0.1% |
| 144512310 | 3 | 0.1% |
| 5773519 | 3 | 0.1% |
| 218051260 | 3 | 0.1% |
| 177343675 | 3 | 0.1% |
| 7000000 | 3 | 0.1% |
| 47000000 | 3 | 0.1% |
| Other values (4136) | 4256 |
| Value | Count | Frequency (%) |
| 162 | 1 | |
| 423 | 1 | |
| 607 | 1 | |
| 703 | 1 | |
| 721 | 1 | |
| 728 | 1 | |
| 828 | 1 | |
| 1029 | 1 | |
| 1100 | 1 | |
| 1111 | 1 |
| Value | Count | Frequency (%) |
| 760505847 | 1 | |
| 658672302 | 1 | |
| 652177271 | 1 | |
| 623279547 | 2 | |
| 533316061 | 1 | |
| 474544677 | 1 | |
| 460935665 | 1 | |
| 458991599 | 1 | |
| 448130642 | 1 | |
| 436471036 | 1 |
genres
Text
| Distinct | 878 |
|---|---|
| Distinct (%) | 18.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 53 |
| Mean length | 20.553214 |
| Min length | 5 |
Characters and Unicode
| Total characters | 97525 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 478 ? |
|---|---|
| Unique (%) | 10.1% |
Sample
| 1st row | Action|Adventure|Fantasy|Sci-Fi |
|---|---|
| 2nd row | Action|Adventure|Fantasy |
| 3rd row | Action|Adventure|Thriller |
| 4th row | Action|Thriller |
| 5th row | Action|Adventure|Sci-Fi |
| Value | Count | Frequency (%) |
| drama | 210 | 4.4% |
| comedy | 190 | 4.0% |
| comedy|drama | 182 | 3.8% |
| comedy|drama|romance | 182 | 3.8% |
| comedy|romance | 150 | 3.2% |
| drama|romance | 148 | 3.1% |
| crime|drama|thriller | 95 | 2.0% |
| horror | 65 | 1.4% |
| action|crime|thriller | 64 | 1.3% |
| action|crime|drama|thriller | 64 | 1.3% |
| Other values (868) | 3395 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 9993 | 10.2% |
| | | 9056 | 9.3% |
| a | 8556 | 8.8% |
| e | 7578 | 7.8% |
| m | 6981 | 7.2% |
| i | 6285 | 6.4% |
| o | 6041 | 6.2% |
| y | 4393 | 4.5% |
| n | 4300 | 4.4% |
| t | 3843 | 3.9% |
| Other values (23) | 30499 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 97525 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 9993 | 10.2% |
| | | 9056 | 9.3% |
| a | 8556 | 8.8% |
| e | 7578 | 7.8% |
| m | 6981 | 7.2% |
| i | 6285 | 6.4% |
| o | 6041 | 6.2% |
| y | 4393 | 4.5% |
| n | 4300 | 4.4% |
| t | 3843 | 3.9% |
| Other values (23) | 30499 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 97525 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 9993 | 10.2% |
| | | 9056 | 9.3% |
| a | 8556 | 8.8% |
| e | 7578 | 7.8% |
| m | 6981 | 7.2% |
| i | 6285 | 6.4% |
| o | 6041 | 6.2% |
| y | 4393 | 4.5% |
| n | 4300 | 4.4% |
| t | 3843 | 3.9% |
| Other values (23) | 30499 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 97525 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 9993 | 10.2% |
| | | 9056 | 9.3% |
| a | 8556 | 8.8% |
| e | 7578 | 7.8% |
| m | 6981 | 7.2% |
| i | 6285 | 6.4% |
| o | 6041 | 6.2% |
| y | 4393 | 4.5% |
| n | 4300 | 4.4% |
| t | 3843 | 3.9% |
| Other values (23) | 30499 |
actor_1_name
Text
| Distinct | 1928 |
|---|---|
| Distinct (%) | 40.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 24 |
| Mean length | 13.179979 |
| Min length | 4 |
Characters and Unicode
| Total characters | 62539 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1227 ? |
|---|---|
| Unique (%) | 25.9% |
Sample
| 1st row | CCH Pounder |
|---|---|
| 2nd row | Johnny Depp |
| 3rd row | Christoph Waltz |
| 4th row | Tom Hardy |
| 5th row | Daryl Sabara |
| Value | Count | Frequency (%) |
| robert | 107 | 1.1% |
| tom | 91 | 0.9% |
| michael | 83 | 0.8% |
| de | 56 | 0.6% |
| jason | 54 | 0.5% |
| james | 50 | 0.5% |
| steve | 50 | 0.5% |
| bruce | 49 | 0.5% |
| jr | 49 | 0.5% |
| niro | 48 | 0.5% |
| Other values (2681) | 9197 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5878 | 9.4% |
| a | 5359 | 8.6% |
| 5089 | 8.1% | |
| n | 4542 | 7.3% |
| r | 4040 | 6.5% |
| i | 3995 | 6.4% |
| o | 3675 | 5.9% |
| l | 3109 | 5.0% |
| t | 2449 | 3.9% |
| s | 2216 | 3.5% |
| Other values (65) | 22187 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62539 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5878 | 9.4% |
| a | 5359 | 8.6% |
| 5089 | 8.1% | |
| n | 4542 | 7.3% |
| r | 4040 | 6.5% |
| i | 3995 | 6.4% |
| o | 3675 | 5.9% |
| l | 3109 | 5.0% |
| t | 2449 | 3.9% |
| s | 2216 | 3.5% |
| Other values (65) | 22187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62539 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5878 | 9.4% |
| a | 5359 | 8.6% |
| 5089 | 8.1% | |
| n | 4542 | 7.3% |
| r | 4040 | 6.5% |
| i | 3995 | 6.4% |
| o | 3675 | 5.9% |
| l | 3109 | 5.0% |
| t | 2449 | 3.9% |
| s | 2216 | 3.5% |
| Other values (65) | 22187 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62539 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5878 | 9.4% |
| a | 5359 | 8.6% |
| 5089 | 8.1% | |
| n | 4542 | 7.3% |
| r | 4040 | 6.5% |
| i | 3995 | 6.4% |
| o | 3675 | 5.9% |
| l | 3109 | 5.0% |
| t | 2449 | 3.9% |
| s | 2216 | 3.5% |
| Other values (65) | 22187 |
movie_title
Text
| Distinct | 4624 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 87 |
|---|---|
| Median length | 59 |
| Mean length | 16.291254 |
| Min length | 2 |
Characters and Unicode
| Total characters | 77302 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4509 ? |
|---|---|
| Unique (%) | 95.0% |
Sample
| 1st row | Avatar |
|---|---|
| 2nd row | Pirates of the Caribbean: At World's End |
| 3rd row | Spectre |
| 4th row | The Dark Knight Rises |
| 5th row | John Carter |
| Value | Count | Frequency (%) |
| the | 1513 | 11.5% |
| of | 452 | 3.4% |
| a | 178 | 1.4% |
| and | 138 | 1.1% |
| in | 115 | 0.9% |
| 2 | 104 | 0.8% |
| to | 99 | 0.8% |
| 76 | 0.6% | |
| man | 64 | 0.5% |
| love | 53 | 0.4% |
| Other values (4679) | 10342 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8389 | 10.9% | |
| e | 7420 | 9.6% |
| Â | 4745 | 6.1% |
| a | 4561 | 5.9% |
| o | 4390 | 5.7% |
| r | 3892 | 5.0% |
| n | 3881 | 5.0% |
| i | 3714 | 4.8% |
| t | 3590 | 4.6% |
| s | 2841 | 3.7% |
| Other values (82) | 29879 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 77302 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 8389 | 10.9% | |
| e | 7420 | 9.6% |
| Â | 4745 | 6.1% |
| a | 4561 | 5.9% |
| o | 4390 | 5.7% |
| r | 3892 | 5.0% |
| n | 3881 | 5.0% |
| i | 3714 | 4.8% |
| t | 3590 | 4.6% |
| s | 2841 | 3.7% |
| Other values (82) | 29879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 77302 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 8389 | 10.9% | |
| e | 7420 | 9.6% |
| Â | 4745 | 6.1% |
| a | 4561 | 5.9% |
| o | 4390 | 5.7% |
| r | 3892 | 5.0% |
| n | 3881 | 5.0% |
| i | 3714 | 4.8% |
| t | 3590 | 4.6% |
| s | 2841 | 3.7% |
| Other values (82) | 29879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 77302 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 8389 | 10.9% | |
| e | 7420 | 9.6% |
| Â | 4745 | 6.1% |
| a | 4561 | 5.9% |
| o | 4390 | 5.7% |
| r | 3892 | 5.0% |
| n | 3881 | 5.0% |
| i | 3714 | 4.8% |
| t | 3590 | 4.6% |
| s | 2841 | 3.7% |
| Other values (82) | 29879 |
num_voted_users
Real number (ℝ)
| Distinct | 4593 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88006.199 |
| Minimum | 5 |
|---|---|
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 1106 |
| Q1 | 10832 |
| median | 38142 |
| Q3 | 102123 |
| 95-th percentile | 343573.2 |
| Maximum | 1689764 |
| Range | 1689759 |
| Interquartile range (IQR) | 91291 |
Descriptive statistics
| Standard deviation | 141146.8 |
|---|---|
| Coefficient of variation (CV) | 1.6038279 |
| Kurtosis | 23.484704 |
| Mean | 88006.199 |
| Median Absolute Deviation (MAD) | 32988 |
| Skewness | 3.9503719 |
| Sum | 4.1758942 × 108 |
| Variance | 1.9922418 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 374 | 3 | 0.1% |
| 3119 | 3 | 0.1% |
| 3665 | 3 | 0.1% |
| 6025 | 3 | 0.1% |
| 2541 | 3 | 0.1% |
| 3943 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| 806 | 2 | < 0.1% |
| 25870 | 2 | < 0.1% |
| 3662 | 2 | < 0.1% |
| Other values (4583) | 4720 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 19 | 1 | |
| 28 | 1 | |
| 37 | 1 | |
| 40 | 1 | |
| 47 | 1 | |
| 48 | 1 | |
| 50 | 1 | |
| 53 | 2 | |
| 59 | 1 |
| Value | Count | Frequency (%) |
| 1689764 | 1 | |
| 1676169 | 1 | |
| 1468200 | 1 | |
| 1347461 | 1 | |
| 1324680 | 1 | |
| 1251222 | 1 | |
| 1238746 | 1 | |
| 1217752 | 1 | |
| 1215718 | 1 | |
| 1155770 | 1 |
cast_total_fb_likes
Real number (ℝ)
| Distinct | 3841 |
|---|---|
| Distinct (%) | 80.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10097.163 |
| Minimum | 0 |
|---|---|
| Maximum | 656730 |
| Zeros | 14 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 243 |
| Q1 | 1509 |
| median | 3233 |
| Q3 | 14486 |
| 95-th percentile | 37757.4 |
| Maximum | 656730 |
| Range | 656730 |
| Interquartile range (IQR) | 12977 |
Descriptive statistics
| Standard deviation | 18236.402 |
|---|---|
| Coefficient of variation (CV) | 1.8060916 |
| Kurtosis | 369.19477 |
| Mean | 10097.163 |
| Median Absolute Deviation (MAD) | 2424 |
| Skewness | 12.857413 |
| Sum | 47911040 |
| Variance | 3.3256636 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14 | 0.3% |
| 2020 | 6 | 0.1% |
| 1044 | 5 | 0.1% |
| 673 | 5 | 0.1% |
| 29 | 5 | 0.1% |
| 1554 | 4 | 0.1% |
| 2486 | 4 | 0.1% |
| 2321 | 4 | 0.1% |
| 1136 | 4 | 0.1% |
| 2348 | 4 | 0.1% |
| Other values (3831) | 4690 |
| Value | Count | Frequency (%) |
| 0 | 14 | |
| 2 | 4 | 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 3 | 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 656730 | 1 | |
| 303717 | 1 | |
| 283939 | 1 | |
| 263584 | 1 | |
| 170118 | 1 | |
| 140268 | 1 | |
| 137712 | 1 | |
| 120797 | 1 | |
| 108016 | 1 | |
| 106759 | 1 |
actor_3_name
Text
| Distinct | 3312 |
|---|---|
| Distinct (%) | 69.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 25 |
| Mean length | 13.073551 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62034 |
|---|---|
| Distinct characters | 81 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2474 ? |
|---|---|
| Unique (%) | 52.1% |
Sample
| 1st row | Wes Studi |
|---|---|
| 2nd row | Jack Davenport |
| 3rd row | Stephanie Sigman |
| 4th row | Joseph Gordon-Levitt |
| 5th row | Polly Walker |
| Value | Count | Frequency (%) |
| michael | 79 | 0.8% |
| john | 73 | 0.7% |
| david | 68 | 0.7% |
| james | 64 | 0.7% |
| robert | 46 | 0.5% |
| kevin | 39 | 0.4% |
| paul | 39 | 0.4% |
| tom | 38 | 0.4% |
| peter | 37 | 0.4% |
| steve | 36 | 0.4% |
| Other values (4097) | 9311 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5643 | 9.1% |
| 5085 | 8.2% | |
| n | 4338 | 7.0% |
| r | 3959 | 6.4% |
| i | 3752 | 6.0% |
| o | 3370 | 5.4% |
| l | 3321 | 5.4% |
| t | 2224 | 3.6% |
| s | 2209 | 3.6% |
| Other values (71) | 22266 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62034 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5643 | 9.1% |
| 5085 | 8.2% | |
| n | 4338 | 7.0% |
| r | 3959 | 6.4% |
| i | 3752 | 6.0% |
| o | 3370 | 5.4% |
| l | 3321 | 5.4% |
| t | 2224 | 3.6% |
| s | 2209 | 3.6% |
| Other values (71) | 22266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62034 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5643 | 9.1% |
| 5085 | 8.2% | |
| n | 4338 | 7.0% |
| r | 3959 | 6.4% |
| i | 3752 | 6.0% |
| o | 3370 | 5.4% |
| l | 3321 | 5.4% |
| t | 2224 | 3.6% |
| s | 2209 | 3.6% |
| Other values (71) | 22266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62034 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 5867 | 9.5% |
| a | 5643 | 9.1% |
| 5085 | 8.2% | |
| n | 4338 | 7.0% |
| r | 3959 | 6.4% |
| i | 3752 | 6.0% |
| o | 3370 | 5.4% |
| l | 3321 | 5.4% |
| t | 2224 | 3.6% |
| s | 2209 | 3.6% |
| Other values (71) | 22266 |
facenumber_in_poster
Real number (ℝ)
ZEROS 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3595364 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 2035 |
| Zeros (%) | 42.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.0082372 |
|---|---|
| Coefficient of variation (CV) | 1.4771486 |
| Kurtosis | 55.329778 |
| Mean | 1.3595364 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.5395464 |
| Sum | 6451 |
| Variance | 4.0330166 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2035 | |
| 1 | 1186 | |
| 2 | 673 | 14.2% |
| 3 | 364 | 7.7% |
| 4 | 193 | 4.1% |
| 5 | 101 | 2.1% |
| 6 | 68 | 1.4% |
| 7 | 45 | 0.9% |
| 8 | 34 | 0.7% |
| 9 | 16 | 0.3% |
| Other values (9) | 30 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 2035 | |
| 1 | 1186 | |
| 2 | 673 | 14.2% |
| 3 | 364 | 7.7% |
| 4 | 193 | 4.1% |
| 5 | 101 | 2.1% |
| 6 | 68 | 1.4% |
| 7 | 45 | 0.9% |
| 8 | 34 | 0.7% |
| 9 | 16 | 0.3% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 5 | 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 4 | 0.1% |
| 11 | 5 | 0.1% |
| 10 | 10 | |
| 9 | 16 |
plot_keywords
Text
| Distinct | 4620 |
|---|---|
| Distinct (%) | 97.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
Length
| Max length | 149 |
|---|---|
| Median length | 102 |
| Mean length | 52.409905 |
| Min length | 2 |
Characters and Unicode
| Total characters | 248685 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4503 ? |
|---|---|
| Unique (%) | 94.9% |
Sample
| 1st row | avatar|future|marine|native|paraplegic |
|---|---|
| 2nd row | goddess|marriage ceremony|marriage proposal|pirate|singapore |
| 3rd row | bomb|espionage|sequel|spy|terrorist |
| 4th row | deception|imprisonment|lawlessness|police officer|terrorist plot |
| 5th row | alien|american civil war|male nipple|mars|princess |
| Value | Count | Frequency (%) |
| in | 314 | 1.8% |
| of | 215 | 1.2% |
| on | 196 | 1.1% |
| the | 187 | 1.1% |
| a | 182 | 1.0% |
| to | 178 | 1.0% |
| york | 120 | 0.7% |
| female | 102 | 0.6% |
| by | 99 | 0.6% |
| based | 98 | 0.6% |
| Other values (11164) | 15734 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24053 | 9.7% |
| a | 18996 | 7.6% |
| | | 18684 | 7.5% |
| i | 18136 | 7.3% |
| r | 17581 | 7.1% |
| t | 15663 | 6.3% |
| n | 15225 | 6.1% |
| o | 15001 | 6.0% |
| s | 12894 | 5.2% |
| 12680 | 5.1% | |
| Other values (32) | 79772 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 248685 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 24053 | 9.7% |
| a | 18996 | 7.6% |
| | | 18684 | 7.5% |
| i | 18136 | 7.3% |
| r | 17581 | 7.1% |
| t | 15663 | 6.3% |
| n | 15225 | 6.1% |
| o | 15001 | 6.0% |
| s | 12894 | 5.2% |
| 12680 | 5.1% | |
| Other values (32) | 79772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 248685 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 24053 | 9.7% |
| a | 18996 | 7.6% |
| | | 18684 | 7.5% |
| i | 18136 | 7.3% |
| r | 17581 | 7.1% |
| t | 15663 | 6.3% |
| n | 15225 | 6.1% |
| o | 15001 | 6.0% |
| s | 12894 | 5.2% |
| 12680 | 5.1% | |
| Other values (32) | 79772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 248685 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 24053 | 9.7% |
| a | 18996 | 7.6% |
| | | 18684 | 7.5% |
| i | 18136 | 7.3% |
| r | 17581 | 7.1% |
| t | 15663 | 6.3% |
| n | 15225 | 6.1% |
| o | 15001 | 6.0% |
| s | 12894 | 5.2% |
| 12680 | 5.1% | |
| Other values (32) | 79772 |
num_user_for_reviews
Real number (ℝ)
| Distinct | 953 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 285.58714 |
| Minimum | 1 |
|---|---|
| Maximum | 5060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 75 |
| median | 167 |
| Q3 | 341 |
| 95-th percentile | 928 |
| Maximum | 5060 |
| Range | 5059 |
| Interquartile range (IQR) | 266 |
Descriptive statistics
| Standard deviation | 383.9471 |
|---|---|
| Coefficient of variation (CV) | 1.3444131 |
| Kurtosis | 25.692807 |
| Mean | 285.58714 |
| Median Absolute Deviation (MAD) | 114 |
| Skewness | 4.0715798 |
| Sum | 1355111 |
| Variance | 147415.37 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 30 | 0.6% |
| 50 | 25 | 0.5% |
| 32 | 24 | 0.5% |
| 14 | 22 | 0.5% |
| 53 | 22 | 0.5% |
| 31 | 22 | 0.5% |
| 21 | 22 | 0.5% |
| 27 | 21 | 0.4% |
| 10 | 21 | 0.4% |
| 39 | 21 | 0.4% |
| Other values (943) | 4515 |
| Value | Count | Frequency (%) |
| 1 | 18 | |
| 2 | 10 | |
| 3 | 17 | |
| 4 | 11 | |
| 5 | 14 | |
| 6 | 17 | |
| 7 | 12 | |
| 8 | 14 | |
| 9 | 17 | |
| 10 | 21 |
| Value | Count | Frequency (%) |
| 5060 | 1 | |
| 4667 | 1 | |
| 4144 | 1 | |
| 3646 | 1 | |
| 3597 | 1 | |
| 3516 | 1 | |
| 3400 | 1 | |
| 3286 | 1 | |
| 3189 | 1 | |
| 3054 | 1 |
country
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
| USA | |
|---|---|
| Other | |
| UK |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.2109589 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15236 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | UK |
| 4th row | USA |
| 5th row | USA |
Common Values
| Value | Count | Frequency (%) |
| USA | 3607 | |
| Other | 713 | 15.0% |
| UK | 425 | 9.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| usa | 3607 | |
| other | 713 | 15.0% |
| uk | 425 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 4032 | |
| S | 3607 | |
| A | 3607 | |
| O | 713 | 4.7% |
| t | 713 | 4.7% |
| h | 713 | 4.7% |
| e | 713 | 4.7% |
| r | 713 | 4.7% |
| K | 425 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15236 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 4032 | |
| S | 3607 | |
| A | 3607 | |
| O | 713 | 4.7% |
| t | 713 | 4.7% |
| h | 713 | 4.7% |
| e | 713 | 4.7% |
| r | 713 | 4.7% |
| K | 425 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15236 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 4032 | |
| S | 3607 | |
| A | 3607 | |
| O | 713 | 4.7% |
| t | 713 | 4.7% |
| h | 713 | 4.7% |
| e | 713 | 4.7% |
| r | 713 | 4.7% |
| K | 425 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15236 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 4032 | |
| S | 3607 | |
| A | 3607 | |
| O | 713 | 4.7% |
| t | 713 | 4.7% |
| h | 713 | 4.7% |
| e | 713 | 4.7% |
| r | 713 | 4.7% |
| K | 425 | 2.8% |
content_rating
Categorical
IMBALANCE 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 203.2 KiB |
| R | |
|---|---|
| PG-13 | |
| PG | |
| G | 109 |
| Not Rated | 102 |
| Other values (10) | 160 |
Length
| Max length | 9 |
|---|---|
| Median length | 1 |
| Mean length | 2.7028451 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12825 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PG-13 |
|---|---|
| 2nd row | PG-13 |
| 3rd row | PG-13 |
| 4th row | PG-13 |
| 5th row | PG-13 |
Common Values
| Value | Count | Frequency (%) |
| R | 2255 | |
| PG-13 | 1436 | |
| PG | 683 | 14.4% |
| G | 109 | 2.3% |
| Not Rated | 102 | 2.1% |
| Unrated | 58 | 1.2% |
| Approved | 55 | 1.2% |
| X | 13 | 0.3% |
| Passed | 9 | 0.2% |
| NC-17 | 7 | 0.1% |
| Other values (5) | 18 | 0.4% |
Length
| Value | Count | Frequency (%) |
| r | 2255 | |
| pg-13 | 1436 | |
| pg | 683 | 14.1% |
| g | 109 | 2.2% |
| not | 102 | 2.1% |
| rated | 102 | 2.1% |
| unrated | 58 | 1.2% |
| approved | 55 | 1.1% |
| x | 13 | 0.3% |
| passed | 9 | 0.2% |
| Other values (6) | 25 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 2357 | |
| G | 2238 | |
| P | 2135 | |
| - | 1450 | |
| 1 | 1446 | |
| 3 | 1436 | |
| t | 262 | 2.0% |
| e | 224 | 1.7% |
| d | 224 | 1.7% |
| a | 169 | 1.3% |
| Other values (17) | 884 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12825 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 2357 | |
| G | 2238 | |
| P | 2135 | |
| - | 1450 | |
| 1 | 1446 | |
| 3 | 1436 | |
| t | 262 | 2.0% |
| e | 224 | 1.7% |
| d | 224 | 1.7% |
| a | 169 | 1.3% |
| Other values (17) | 884 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12825 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 2357 | |
| G | 2238 | |
| P | 2135 | |
| - | 1450 | |
| 1 | 1446 | |
| 3 | 1436 | |
| t | 262 | 2.0% |
| e | 224 | 1.7% |
| d | 224 | 1.7% |
| a | 169 | 1.3% |
| Other values (17) | 884 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12825 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 2357 | |
| G | 2238 | |
| P | 2135 | |
| - | 1450 | |
| 1 | 1446 | |
| 3 | 1436 | |
| t | 262 | 2.0% |
| e | 224 | 1.7% |
| d | 224 | 1.7% |
| a | 169 | 1.3% |
| Other values (17) | 884 | 6.9% |
budget
Real number (ℝ)
SKEWED 
| Distinct | 432 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39308861 |
| Minimum | 218 |
|---|---|
| Maximum | 1.22155 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 800000 |
| Q1 | 7500000 |
| median | 20000000 |
| Q3 | 40000000 |
| 95-th percentile | 1.25 × 108 |
| Maximum | 1.22155 × 1010 |
| Range | 1.22155 × 1010 |
| Interquartile range (IQR) | 32500000 |
Descriptive statistics
| Standard deviation | 2.0182661 × 108 |
|---|---|
| Coefficient of variation (CV) | 5.1343796 |
| Kurtosis | 2842.5051 |
| Mean | 39308861 |
| Median Absolute Deviation (MAD) | 15000000 |
| Skewness | 49.200958 |
| Sum | 1.8652054 × 1011 |
| Variance | 4.0733982 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20000000 | 448 | 9.4% |
| 30000000 | 146 | 3.1% |
| 15000000 | 143 | 3.0% |
| 25000000 | 141 | 3.0% |
| 10000000 | 137 | 2.9% |
| 40000000 | 132 | 2.8% |
| 35000000 | 121 | 2.6% |
| 50000000 | 104 | 2.2% |
| 5000000 | 103 | 2.2% |
| 60000000 | 94 | 2.0% |
| Other values (422) | 3176 |
| Value | Count | Frequency (%) |
| 218 | 1 | < 0.1% |
| 1100 | 1 | < 0.1% |
| 4500 | 1 | < 0.1% |
| 7000 | 3 | |
| 9000 | 1 | < 0.1% |
| 10000 | 2 | |
| 14000 | 1 | < 0.1% |
| 15000 | 2 | |
| 20000 | 3 | |
| 22000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.22155 × 1010 | 1 | |
| 4200000000 | 1 | |
| 2500000000 | 1 | |
| 2400000000 | 1 | |
| 2127519898 | 1 | |
| 1100000000 | 1 | |
| 1000000000 | 1 | |
| 700000000 | 2 | |
| 600000000 | 1 | |
| 553632000 | 1 |
title_year
Real number (ℝ)
| Distinct | 91 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.1144 |
| Minimum | 1916 |
|---|---|
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 1916 |
|---|---|
| 5-th percentile | 1978 |
| Q1 | 1999 |
| median | 2005 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2016 |
| Range | 100 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 12.502688 |
|---|---|
| Coefficient of variation (CV) | 0.0062447421 |
| Kurtosis | 7.3303826 |
| Mean | 2002.1144 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -2.2773413 |
| Sum | 9500033 |
| Variance | 156.31721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2009 | 254 | 5.4% |
| 2006 | 236 | 5.0% |
| 2010 | 222 | 4.7% |
| 2008 | 222 | 4.7% |
| 2014 | 218 | 4.6% |
| 2011 | 215 | 4.5% |
| 2005 | 215 | 4.5% |
| 2013 | 214 | 4.5% |
| 2004 | 209 | 4.4% |
| 2012 | 206 | 4.3% |
| Other values (81) | 2534 |
| Value | Count | Frequency (%) |
| 1916 | 1 | |
| 1920 | 1 | |
| 1925 | 1 | |
| 1927 | 1 | |
| 1929 | 2 | |
| 1930 | 1 | |
| 1932 | 1 | |
| 1933 | 2 | |
| 1934 | 1 | |
| 1935 | 1 |
| Value | Count | Frequency (%) |
| 2016 | 85 | 1.8% |
| 2015 | 186 | |
| 2014 | 218 | |
| 2013 | 214 | |
| 2012 | 206 | |
| 2011 | 215 | |
| 2010 | 222 | |
| 2009 | 254 | |
| 2008 | 222 | |
| 2007 | 199 |
actor_2_fb_likes
Real number (ℝ)
| Distinct | 906 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1727.8936 |
| Minimum | 0 |
|---|---|
| Maximum | 137000 |
| Zeros | 32 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 300 |
| median | 617 |
| Q3 | 931 |
| 95-th percentile | 11000 |
| Maximum | 137000 |
| Range | 137000 |
| Interquartile range (IQR) | 631 |
Descriptive statistics
| Standard deviation | 4148.4537 |
|---|---|
| Coefficient of variation (CV) | 2.4008734 |
| Kurtosis | 244.61784 |
| Mean | 1727.8936 |
| Median Absolute Deviation (MAD) | 316 |
| Skewness | 9.6427963 |
| Sum | 8198855 |
| Variance | 17209668 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 302 | 6.4% |
| 11000 | 111 | 2.3% |
| 2000 | 98 | 2.1% |
| 3000 | 75 | 1.6% |
| 10000 | 47 | 1.0% |
| 14000 | 41 | 0.9% |
| 13000 | 40 | 0.8% |
| 826 | 37 | 0.8% |
| 4000 | 33 | 0.7% |
| 0 | 32 | 0.7% |
| Other values (896) | 3929 |
| Value | Count | Frequency (%) |
| 0 | 32 | |
| 2 | 11 | 0.2% |
| 3 | 9 | 0.2% |
| 4 | 11 | 0.2% |
| 5 | 8 | 0.2% |
| 6 | 7 | 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 9 | 0.2% |
| 9 | 12 | 0.3% |
| 10 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 137000 | 1 | < 0.1% |
| 29000 | 1 | < 0.1% |
| 27000 | 2 | < 0.1% |
| 25000 | 3 | 0.1% |
| 23000 | 6 | |
| 22000 | 11 | |
| 21000 | 4 | 0.1% |
| 20000 | 6 | |
| 19000 | 7 | |
| 18000 | 9 |
imdb_score
Real number (ℝ)
| Distinct | 76 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4322234 |
| Minimum | 1.6 |
|---|---|
| Maximum | 9.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 1.6 |
|---|---|
| 5-th percentile | 4.4 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8 |
| Maximum | 9.3 |
| Range | 7.7 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.1002781 |
|---|---|
| Coefficient of variation (CV) | 0.1710572 |
| Kurtosis | 1.0444812 |
| Mean | 6.4322234 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.76831284 |
| Sum | 30520.9 |
| Variance | 1.2106119 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.7 | 214 | 4.5% |
| 6.6 | 192 | 4.0% |
| 7.2 | 183 | 3.9% |
| 6.5 | 183 | 3.9% |
| 6.4 | 180 | 3.8% |
| 6.8 | 178 | 3.8% |
| 7 | 176 | 3.7% |
| 7.3 | 175 | 3.7% |
| 6.1 | 174 | 3.7% |
| 7.1 | 173 | 3.6% |
| Other values (66) | 2917 |
| Value | Count | Frequency (%) |
| 1.6 | 1 | < 0.1% |
| 1.7 | 1 | < 0.1% |
| 1.9 | 3 | |
| 2 | 2 | |
| 2.1 | 3 | |
| 2.2 | 3 | |
| 2.3 | 3 | |
| 2.4 | 2 | |
| 2.5 | 2 | |
| 2.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.3 | 1 | < 0.1% |
| 9.2 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8.9 | 5 | 0.1% |
| 8.8 | 5 | 0.1% |
| 8.7 | 8 | 0.2% |
| 8.6 | 11 | 0.2% |
| 8.5 | 21 | |
| 8.4 | 23 | |
| 8.3 | 35 |
aspect_ratio
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1256797 |
| Minimum | 1.18 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.78 |
| Q1 | 1.85 |
| median | 2.35 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.63597663 |
|---|---|
| Coefficient of variation (CV) | 0.29918743 |
| Kurtosis | 379.54111 |
| Mean | 2.1256797 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.448477 |
| Sum | 10086.35 |
| Variance | 0.40446628 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.35 | 2523 | |
| 1.85 | 1886 | |
| 1.37 | 97 | 2.0% |
| 1.78 | 80 | 1.7% |
| 1.66 | 63 | 1.3% |
| 1.33 | 37 | 0.8% |
| 2.2 | 15 | 0.3% |
| 2.39 | 14 | 0.3% |
| 16 | 8 | 0.2% |
| 2 | 4 | 0.1% |
| Other values (10) | 18 | 0.4% |
| Value | Count | Frequency (%) |
| 1.18 | 1 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.33 | 37 | 0.8% |
| 1.37 | 97 | |
| 1.44 | 1 | < 0.1% |
| 1.5 | 2 | < 0.1% |
| 1.66 | 63 | |
| 1.75 | 3 | 0.1% |
| 1.77 | 1 | < 0.1% |
| 1.78 | 80 |
| Value | Count | Frequency (%) |
| 16 | 8 | 0.2% |
| 2.76 | 3 | 0.1% |
| 2.55 | 2 | < 0.1% |
| 2.4 | 3 | 0.1% |
| 2.39 | 14 | 0.3% |
| 2.35 | 2523 | |
| 2.24 | 1 | < 0.1% |
| 2.2 | 15 | 0.3% |
| 2 | 4 | 0.1% |
| 1.85 | 1886 |
movie_fb_likes
Real number (ℝ)
ZEROS 
| Distinct | 836 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7822.658 |
| Minimum | 0 |
|---|---|
| Maximum | 349000 |
| Zeros | 2104 |
| Zeros (%) | 44.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 203.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 181 |
| Q3 | 5000 |
| 95-th percentile | 41800 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 5000 |
Descriptive statistics
| Standard deviation | 19644.251 |
|---|---|
| Coefficient of variation (CV) | 2.511199 |
| Kurtosis | 39.865391 |
| Mean | 7822.658 |
| Median Absolute Deviation (MAD) | 181 |
| Skewness | 4.9492684 |
| Sum | 37118512 |
| Variance | 3.8589659 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2104 | |
| 1000 | 103 | 2.2% |
| 11000 | 81 | 1.7% |
| 10000 | 79 | 1.7% |
| 12000 | 60 | 1.3% |
| 13000 | 58 | 1.2% |
| 2000 | 54 | 1.1% |
| 15000 | 52 | 1.1% |
| 14000 | 47 | 1.0% |
| 16000 | 46 | 1.0% |
| Other values (826) | 2061 |
| Value | Count | Frequency (%) |
| 0 | 2104 | |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 349000 | 1 | |
| 199000 | 1 | |
| 197000 | 1 | |
| 191000 | 1 | |
| 190000 | 1 | |
| 175000 | 1 | |
| 165000 | 1 | |
| 164000 | 1 | |
| 153000 | 1 | |
| 150000 | 1 |
| director_name | num_critic_for_reviews | duration | director_fb_likes | actor_3_fb_likes | actor_2_name | actor_1_fb_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_fb_likes | actor_3_name | facenumber_in_poster | plot_keywords | num_user_for_reviews | country | content_rating | budget | title_year | actor_2_fb_likes | imdb_score | aspect_ratio | movie_fb_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | James Cameron | 723.0 | 178.0 | 0.0 | 855.0 | Joel David Moore | 1000.0 | 760505847.0 | Action|Adventure|Fantasy|Sci-Fi | CCH Pounder | Avatar | 886204 | 4834 | Wes Studi | 0.0 | avatar|future|marine|native|paraplegic | 3054.0 | USA | PG-13 | 237000000.0 | 2009.0 | 936.0 | 7.9 | 1.78 | 33000 |
| 1 | Gore Verbinski | 302.0 | 169.0 | 563.0 | 1000.0 | Orlando Bloom | 40000.0 | 309404152.0 | Action|Adventure|Fantasy | Johnny Depp | Pirates of the Caribbean: At World's End | 471220 | 48350 | Jack Davenport | 0.0 | goddess|marriage ceremony|marriage proposal|pirate|singapore | 1238.0 | USA | PG-13 | 300000000.0 | 2007.0 | 5000.0 | 7.1 | 2.35 | 0 |
| 2 | Sam Mendes | 602.0 | 148.0 | 0.0 | 161.0 | Rory Kinnear | 11000.0 | 200074175.0 | Action|Adventure|Thriller | Christoph Waltz | Spectre | 275868 | 11700 | Stephanie Sigman | 1.0 | bomb|espionage|sequel|spy|terrorist | 994.0 | UK | PG-13 | 245000000.0 | 2015.0 | 393.0 | 6.8 | 2.35 | 85000 |
| 3 | Christopher Nolan | 813.0 | 164.0 | 22000.0 | 23000.0 | Christian Bale | 27000.0 | 448130642.0 | Action|Thriller | Tom Hardy | The Dark Knight Rises | 1144337 | 106759 | Joseph Gordon-Levitt | 0.0 | deception|imprisonment|lawlessness|police officer|terrorist plot | 2701.0 | USA | PG-13 | 250000000.0 | 2012.0 | 23000.0 | 8.5 | 2.35 | 164000 |
| 5 | Andrew Stanton | 462.0 | 132.0 | 475.0 | 530.0 | Samantha Morton | 640.0 | 73058679.0 | Action|Adventure|Sci-Fi | Daryl Sabara | John Carter | 212204 | 1873 | Polly Walker | 1.0 | alien|american civil war|male nipple|mars|princess | 738.0 | USA | PG-13 | 263700000.0 | 2012.0 | 632.0 | 6.6 | 2.35 | 24000 |
| 6 | Sam Raimi | 392.0 | 156.0 | 0.0 | 4000.0 | James Franco | 24000.0 | 336530303.0 | Action|Adventure|Romance | J.K. Simmons | Spider-Man 3 | 383056 | 46055 | Kirsten Dunst | 0.0 | sandman|spider man|symbiote|venom|villain | 1902.0 | USA | PG-13 | 258000000.0 | 2007.0 | 11000.0 | 6.2 | 2.35 | 0 |
| 7 | Nathan Greno | 324.0 | 100.0 | 15.0 | 284.0 | Donna Murphy | 799.0 | 200807262.0 | Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance | Brad Garrett | Tangled | 294810 | 2036 | M.C. Gainey | 1.0 | 17th century|based on fairy tale|disney|flower|tower | 387.0 | USA | PG | 260000000.0 | 2010.0 | 553.0 | 7.8 | 1.85 | 29000 |
| 8 | Joss Whedon | 635.0 | 141.0 | 0.0 | 19000.0 | Robert Downey Jr. | 26000.0 | 458991599.0 | Action|Adventure|Sci-Fi | Chris Hemsworth | Avengers: Age of Ultron | 462669 | 92000 | Scarlett Johansson | 4.0 | artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero | 1117.0 | USA | PG-13 | 250000000.0 | 2015.0 | 21000.0 | 7.5 | 2.35 | 118000 |
| 9 | David Yates | 375.0 | 153.0 | 282.0 | 10000.0 | Daniel Radcliffe | 25000.0 | 301956980.0 | Adventure|Family|Fantasy|Mystery | Alan Rickman | Harry Potter and the Half-Blood Prince | 321795 | 58753 | Rupert Grint | 3.0 | blood|book|love|potion|professor | 973.0 | UK | PG | 250000000.0 | 2009.0 | 11000.0 | 7.5 | 2.35 | 10000 |
| 10 | Zack Snyder | 673.0 | 183.0 | 0.0 | 2000.0 | Lauren Cohan | 15000.0 | 330249062.0 | Action|Adventure|Sci-Fi | Henry Cavill | Batman v Superman: Dawn of Justice | 371639 | 24450 | Alan D. Purwin | 0.0 | based on comic book|batman|sequel to a reboot|superhero|superman | 3018.0 | USA | PG-13 | 250000000.0 | 2016.0 | 4000.0 | 6.9 | 2.35 | 197000 |
| director_name | num_critic_for_reviews | duration | director_fb_likes | actor_3_fb_likes | actor_2_name | actor_1_fb_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_fb_likes | actor_3_name | facenumber_in_poster | plot_keywords | num_user_for_reviews | country | content_rating | budget | title_year | actor_2_fb_likes | imdb_score | aspect_ratio | movie_fb_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5026 | Olivier Assayas | 81.0 | 110.0 | 107.0 | 45.0 | Béatrice Dalle | 576.0 | 136007.0 | Drama|Music|Romance | Maggie Cheung | Clean | 3924 | 776 | Don McKellar | 1.0 | jail|junkie|money|motel|singer | 39.0 | Other | R | 4500.0 | 2004.0 | 133.0 | 6.9 | 2.35 | 171 |
| 5027 | Jafar Panahi | 64.0 | 90.0 | 397.0 | 0.0 | Nargess Mamizadeh | 5.0 | 673780.0 | Drama | Fereshteh Sadre Orafaiy | The Circle | 4555 | 5 | Mojgan Faramarzi | 0.0 | abortion|bus|hospital|prison|prostitution | 26.0 | Other | Not Rated | 10000.0 | 2000.0 | 0.0 | 7.5 | 1.85 | 697 |
| 5029 | Kiyoshi Kurosawa | 78.0 | 111.0 | 62.0 | 6.0 | Anna Nakagawa | 89.0 | 94596.0 | Crime|Horror|Mystery|Thriller | Kôji Yakusho | The Cure | 6318 | 115 | Denden | 0.0 | breasts|interrogation|investigation|murder|watching television | 50.0 | Other | R | 1000000.0 | 1997.0 | 13.0 | 7.4 | 1.85 | 817 |
| 5032 | Ash Baron-Cohen | 10.0 | 98.0 | 3.0 | 152.0 | Stanley B. Herman | 789.0 | 24848292.0 | Crime|Drama | Peter Greene | Bang | 438 | 1186 | James Noble | 1.0 | corruption|homeless|homeless man|motorcycle|urban legend | 14.0 | USA | R | 20000000.0 | 1995.0 | 194.0 | 6.4 | 2.35 | 20 |
| 5033 | Shane Carruth | 143.0 | 77.0 | 291.0 | 8.0 | David Sullivan | 291.0 | 424760.0 | Drama|Sci-Fi|Thriller | Shane Carruth | Primer | 72639 | 368 | Casey Gooden | 0.0 | changing the future|independent film|invention|nonlinear timeline|time travel | 371.0 | USA | PG-13 | 7000.0 | 2004.0 | 45.0 | 7.0 | 1.85 | 19000 |
| 5034 | Neill Dela Llana | 35.0 | 80.0 | 0.0 | 0.0 | Edgar Tancangco | 0.0 | 70071.0 | Thriller | Ian Gamazon | Cavite | 589 | 0 | Quynn Ton | 0.0 | jihad|mindanao|philippines|security guard|squatter | 35.0 | Other | Not Rated | 7000.0 | 2005.0 | 0.0 | 6.3 | 2.35 | 74 |
| 5035 | Robert Rodriguez | 56.0 | 81.0 | 0.0 | 6.0 | Peter Marquardt | 121.0 | 2040920.0 | Action|Crime|Drama|Romance|Thriller | Carlos Gallardo | El Mariachi | 52055 | 147 | Consuelo Gómez | 0.0 | assassin|death|guitar|gun|mariachi | 130.0 | USA | R | 7000.0 | 1992.0 | 20.0 | 6.9 | 1.37 | 0 |
| 5037 | Edward Burns | 14.0 | 95.0 | 0.0 | 133.0 | Caitlin FitzGerald | 296.0 | 4584.0 | Comedy|Drama | Kerry Bishé | Newlyweds | 1338 | 690 | Daniella Pineda | 1.0 | written and directed by cast member | 14.0 | USA | Not Rated | 9000.0 | 2011.0 | 205.0 | 6.4 | 2.35 | 413 |
| 5038 | Scott Smith | 1.0 | 87.0 | 2.0 | 318.0 | Daphne Zuniga | 637.0 | 24848292.0 | Comedy|Drama | Eric Mabius | Signed Sealed Delivered | 629 | 2283 | Crystal Lowe | 2.0 | fraud|postal worker|prison|theft|trial | 6.0 | Other | R | 20000000.0 | 2013.0 | 470.0 | 7.7 | 2.35 | 84 |
| 5042 | Jon Gunn | 43.0 | 90.0 | 16.0 | 16.0 | Brian Herzlinger | 86.0 | 85222.0 | Documentary | John August | My Date with Drew | 4285 | 163 | Jon Gunn | 0.0 | actress name in title|crush|date|four word title|video camera | 84.0 | USA | PG | 1100.0 | 2004.0 | 23.0 | 6.6 | 1.85 | 456 |
Most frequently occurring
| director_name | num_critic_for_reviews | duration | director_fb_likes | actor_3_fb_likes | actor_2_name | actor_1_fb_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_fb_likes | actor_3_name | facenumber_in_poster | plot_keywords | num_user_for_reviews | country | content_rating | budget | title_year | actor_2_fb_likes | imdb_score | aspect_ratio | movie_fb_likes | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Albert Hughes | 208.0 | 122.0 | 117.0 | 140.0 | Jason Flemyng | 40000.0 | 31598308.0 | Horror|Mystery|Thriller | Johnny Depp | From Hell | 124765 | 41636 | Ian Richardson | 1.0 | freemason|jack the ripper|opium|prostitute|victorian era | 541.0 | USA | R | 35000000.0 | 2001.0 | 1000.0 | 6.8 | 2.35 | 0 | 2 |
| 1 | Angelina Jolie Pitt | 322.0 | 137.0 | 11000.0 | 465.0 | Jack O'Connell | 769.0 | 115603980.0 | Biography|Drama|Sport|War | Finn Wittrock | Unbroken | 103589 | 2938 | Alex Russell | 0.0 | emaciation|male nudity|plane crash|prisoner of war|torture | 351.0 | USA | PG-13 | 65000000.0 | 2014.0 | 698.0 | 7.2 | 2.35 | 35000 | 2 |
| 2 | Bill Condon | 322.0 | 115.0 | 386.0 | 12000.0 | Kristen Stewart | 21000.0 | 292298923.0 | Adventure|Drama|Fantasy|Romance | Robert Pattinson | The Twilight Saga: Breaking Dawn - Part 2 | 185394 | 59177 | Taylor Lautner | 3.0 | battle|friend|super strength|vampire|vision | 329.0 | USA | PG-13 | 120000000.0 | 2012.0 | 17000.0 | 5.5 | 2.35 | 65000 | 2 |
| 3 | Brett Ratner | 245.0 | 101.0 | 420.0 | 467.0 | Rufus Sewell | 12000.0 | 72660029.0 | Action|Adventure | Dwayne Johnson | Hercules | 115687 | 16235 | Ingrid Bolsø Berdal | 0.0 | army|greek mythology|hercules|king|mercenary | 269.0 | USA | PG-13 | 100000000.0 | 2014.0 | 3000.0 | 6.0 | 2.35 | 21000 | 2 |
| 4 | Bruce McCulloch | 52.0 | 85.0 | 54.0 | 455.0 | Megan Mullally | 985.0 | 13973532.0 | Comedy|Crime | Martin Starr | Stealing Harvard | 11211 | 3065 | Chris Penn | 1.0 | black humor|crying during sex|harvard|humor|man with glasses | 92.0 | USA | PG-13 | 25000000.0 | 2002.0 | 637.0 | 5.1 | 1.85 | 215 | 2 |
| 5 | Danny Boyle | 393.0 | 101.0 | 0.0 | 888.0 | Spencer Wilding | 3000.0 | 2319187.0 | Crime|Drama|Mystery|Thriller | Rosario Dawson | Trance | 92640 | 5056 | Tuppence Middleton | 0.0 | amnesia|criminal|heist|hypnotherapy|lost painting | 212.0 | UK | R | 20000000.0 | 2013.0 | 1000.0 | 7.0 | 2.35 | 23000 | 2 |
| 6 | David Hewlett | 8.0 | 88.0 | 686.0 | 405.0 | David Hewlett | 847.0 | 24848292.0 | Comedy | Christopher Judge | A Dog's Breakfast | 3262 | 2364 | Paul McGillion | 2.0 | dog|vegetarian | 46.0 | Other | R | 120000.0 | 2007.0 | 686.0 | 7.0 | 1.78 | 377 | 2 |
| 7 | David Yates | 248.0 | 110.0 | 282.0 | 103.0 | Alexander Skarsgård | 11000.0 | 124051759.0 | Action|Adventure|Drama|Romance | Christoph Waltz | The Legend of Tarzan | 42372 | 21175 | Casper Crump | 2.0 | africa|capture|jungle|male objectification|tarzan | 239.0 | USA | PG-13 | 180000000.0 | 2016.0 | 10000.0 | 6.6 | 2.35 | 29000 | 2 |
| 8 | Frank Oz | 168.0 | 87.0 | 0.0 | 548.0 | Ewen Bremner | 22000.0 | 8579684.0 | Comedy | Peter Dinklage | Death at a Funeral | 89547 | 24324 | Kris Marshall | 0.0 | end credits roll call|four word title|funeral|secret|uncle | 199.0 | USA | R | 9000000.0 | 2007.0 | 557.0 | 7.4 | 1.85 | 0 | 2 |
| 9 | George A. Romero | 284.0 | 96.0 | 0.0 | 56.0 | Duane Jones | 125.0 | 236452.0 | Drama|Horror|Mystery | Judith O'Dea | Night of the Living Dead | 87978 | 403 | S. William Hinzman | 5.0 | cemetery|farmhouse|radiation|running out of gas|zombie | 580.0 | USA | Unrated | 114000.0 | 1968.0 | 108.0 | 8.0 | 1.85 | 0 | 2 |